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Abstract 

What distribution of graphical degree sequence is invariant under "scal- 
ing"? Are these graphs always power- law graphs? We show the answer 
is a surprising "yes" for sparse graphs if we ignore isolated vertices, or 
more generally, the vertices with degrees less than a fixed constant k. We 
obtain a concentration result on the degree sequence of a random induced 
subgraph. The case of hypergraphs (or set-systems) is also examined. 

1 Introduction 

Quite a few recent papers use the term "scale-free networks" to refer to those 
large sparse graphs formed from real-world data. Such graphs often exhibit 
power-law degree distributions. Namely, the number of vertices with degree d 
is roughly proportional to , for some positive (3. However, the term "scale- 
free" is rarely defined in the literature, at least in the rigorous mathematical 
sense. Furthermore, accounts in the literature of how power laws arise have been 
largely model-dependent. That is, a number of models of random-graph growth 
have been proposed that give rise, under circumstances of varying generality, to 
power-law degree distributions. The most popular growth model of this kind is 
the "preferential attachment" scheme, exemplified by p,[51[^[^. 

Though many of the growth rules are quite intuitive - in that one expects 
many real-world phenomena to approximate them - an explanation of the sheer 
ubiquity of power laws that docs not appeal to particular models is conspicuously 
lacking. 

Here we attempt to address these omissions. First, it is natural to ask, what 
is a scale? An obvious candidate for a scale is the number of vertices of a graph. 
Here "scaling the graph down" means "taking an induced subgraph" . Of course, 
subgraphs may look quite different from one another. Hence, we consider only 
the average behavior. 

Random induced subgraph Gpi For any < p < 1, let Gp be the induced 
subgraph of G on a random subset of vertices S. For each vertex v of G, v is in 
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V{Gp) with probability p independently. 

There are some simple cases that the graph Gp is similar to G. For example, 

• Let G be a complete graph on n vertices. Then Gp is also a complete 
graph on around pn vertices. 

• Let G be an empty graph on n vertices. Then Gp is also an empty graph 
on around pn vertices. 

• For any constant q G (0, 1), let G be the random graph G{n, q). Then Gp 
is also a random graph G{m.. q) over a randomly chosen set of size m ^ pn. 

Crucially, these examples are not "real- world graphs" , in the sense that graphs 
appearing "in nature" tend to be quite sparse. Most vertices have small degrees. 
To characterize this property, we use the following definition: 

For a given sequence {A^j^Q satisfying Y^^=o^'i = 1) with Ad > for all 
d > 0, a sequence of graphs {G"} on n vertices is said to have degree sequence 
with limit distribution {Xd]'dLo number of vertices with degree d in G" 

is A^n + o(n) for each d > 0. We also say that {G"} has limit distribution 
{\d}'dLk^ for X^d>j. Ad < 1, if G" has A^n + o(n) vertices of degree d for each 
d>k. 

We consider two questions. 

1. If the degree sequence of G in {G„} has a limit distribution, for any fixed 
p, does the degree sequence of the random induced subgraph Gp also have 
a limit distribution? 

2. For what distribution {A^j^Q is the limit distribution of the degree se- 
quence of Gp essentially the same as the limit distribution of the degree 
sequence of G? 

To answer the first question, we observe that a vertex of degree an in G would 

badly affect the concentration of the degree sequence of Gp. On the other hand, 
using the vertex-exposure martingale, we can show that the degree sequence of 
Gp will have a limit distribution if 

^deg2(i;) =0(n'-^). 

V 

This condition is satisfied, for example, if G has maximum degree bounded by 

Suppose ao, ai, 02, . . . , is the degree frequency sequence of a graph G, with 
ad representing the number of vertices in G with degree d. What is the degree 
frequency seqiienee of Gp? If a vertex v survives in Gp, its degree has binomial 
distribution B(dG{v),p). There is no simple way to describe the joint distribu- 
tion because of edge-correlations. Nonetheless, the expected degree frequency 
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sequence for Gp is easy to compute. Let 60,61,62, .. . be the expected degree 
frequency sequence of the random induced subgraph Gp. We have 

hd^vY^aJ^^il-pf-" (1) 

k>i ^ ^ 

for all c? = 0, 1, 2, . . .. Note that {hd]d>G depends linearly on {ad]d>{)- We can 
therefore normalize both sequences by dividing by n. 

Therefore, from now on, we assume are the fraction of numbers of vertices 
with degree i in graph G. More precisely, we consider a sequence of graphs G„, 
such that the number of vertices with degree d in Gn is a^n + o(n). We only 
consider sparse graphs such that 

E«' = i- (2) 

It is worth remarking that this manuscript can be read, in effect, as a re- 
sponse to the well-known Stumpf, Wiuf, and May paper, "Subnets of scale-free 
networks are not scale- free: Sampling properties of networks" ([3DJ) and its au- 
thors' related publications. Although the present authors became aware of this 
work only after discovering the results below, it is clear that there is a very 
strong resemblance to the work of Stumpf, Wiuf, and May. However, we of- 
fer the counter-assertion "Subnets of scale-free networks are scale-free, as long 
as one ignores suitably small-degree vertices." We also take a somewhat dif- 
ferent tack by studying, in particular, the asymptotic conditions under which 
scale-freeness holds. 



2 Scale-Free Degree Sequences 

Let A{x) = YlTLa o-ix'' be the generating function of {ai}i>o and B{x) = 
J^iLo ^i^^ ^'^ generating function of {bi}i>o- Both A{x) and B{x) converge 
on [-1,1]. 
We have 



B{x) = ^6,a;* 

i=0 

i=0 k>i ^ ^ 

00 fc ... ^ K ... 

= ^ E «^ E (M^ - pr--' + • E E (M^ - p) 

k^O i^O ^ ^ k^O i-n ^ ^ 

00 CO 

= p^ak{l-p + px)'' + o{l)"^{l-p + px) 

k=0 

= pA{l — p + px) 



k=a k=0 
0(1) 



k^^^^ 
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Scale-free degree sequence starting at 0. 

A naive way to define scalc-freeness is to require 

h = f{p)a^ + o(l) for all ^ > 0, (3) 

where f{p) is a quantity depending only on p. 

Equivalently, for any x € [—1, 1] and p e (0, 1), we have 

pAil-p + px) = f{p)A{x). (4) 

To solve equation (g]), let a; = 1. We get pA{l) ^ f{p)A{l). Thus f{p) = p. We 
have 

A{1 - p + px) ^ A{x) . (5) 
Let a; = 0. We have A{Q) = A{l-p). Therefore, 

^'(0) = lin.^M^ 

x^O X 

A{l-p + px) - A{l-p) 
= nm 

x^O X 

= pA'{l~p). 



Since this holds for any p e (0, 1), we have 
A{p) = AiO) + 



f A'{l-p)dp 
Ji-p 



Thus, 
We have 



Jl^p p 

= A{0)^A'{0)ln{l-p). 
A{x)=A{0)-A'{0)\n{l~x). 

A{l-p + px) = A{0)~ A'{0)lii{p-px) 

= A(0)- A'(0)(lnp + ln(l-.T)) 
= A{x)- A'{0)lnp. 

This forces A'{0) = 0. The only solution for equation (g]) is A{x) = A{0) (the 
constant function, corresponding to a graph with no edges). This solution is 
not interesting. 

Scale- free degree sequence starting at 1. 
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In many cases, we do not care about the number of isolated vertices. We 
only require that 

bd^ fip)ad + o{l) foralld>l. (6) 

where f{p) is a quantity depending only on p. 

Equivalently, for any p £ (0, 1) and x G [—1, 1], we have 

f{p){A{x) - A{0)) - p{A{l -p + px)- A{1 - p)). (7) 

Take the derivative with respect to x on both sides. We have, for any p E (0, 1) 
and X e (—1, 1), 

f{p)A'{x)^p^A'il-p + px). (8) 

Let a = /p ^^dp be a positive constant. Divide both sides of equation ^ by 
and integrate it with respect to p from to 1. We have 

aA' (x) = / A' {1 — p + px)dp 



Rewriting this expression. 



^(1) - M^) 

l-x 
1 - A{x) 

l-x 



A'(x) _ 1 



1-A{x) a{l-x)' 
Now, integrate with respect to x from to x. We get 



(9) 



Therefore, we have 

A{x) = l~{l~A{0)){l-x)^. (11) 

It is easy to verify that equation (fTTjl satisfies equation ([8]) with f{p) = pi^^. 

We do not care about A{Q) — ao, the number of isolated vertices. Hence, 
the solution is uniquely determined by a parameter a up to a a constant factor. 
For d > 1, we have 

1 \ 

d+l 



<^d = (l-ao)(^;^j(-l) 

= o(d-(^+i)). 

In other words, the degree frequency sequence follows a power- law distribution 
with exponent (3 — 1 + l/a. However, not all are positive. Particularly, if 
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/3 > 2, then there are negative terms a^^ d > 1. 

Scale-free degree sequence starting at k. 

Now we assume that the degree sequence distribution, considering only de- 
grees at least k, is scale-free. That is, 

bd^ f{p)ad + o{l) foralld>fc. (12) 

where f{p) is a quantity depending only on p. 

Or equivalently, for any p € (0, 1) and x G [— 1, 1], we have 

k — l k — 1 / 1 \ 

miAix) - J2 ^dx') = piAil -p + px)-Y, ^dx'' ( J/(l - p)''-^). (13) 

Take the k-ih derivative with respect to x on both sides to get rid of all terms 
of degree up to fc — 1. We have, for any p S (0, 1) and x € (—1, 1), 

/(p) AC^) ix) = AC^) il -p + px). (14) 

Let Qffc = jj^ ^J,^i dp. Similar arguments to those above show that the solution 
of equation (fT4|) is of form 

A'^-I(.t) =Ci-C2(l-a;)". 
If we then integrate with respect to x A: — 1 times, the result is 

Aix)=Pkix)-Cil-x)^^+\ (15) 

Here Pkix) is a polynomial of x with degree fc — 1. It is easy to verify that 
equation (fT5|) is the solution of equation (fT3|l with /(p) = Let /? — -^+k. 

For any d> k, we have 



If we set 




then the aa are positive for d> (3. Note that sgn(C/3) = (— 1)L''J. 



3 Concentration 

Since we know that the only degree sequences which are scale- free in expectation 
have power-law limit distributions, it is crucial to show that such graph have 
degree sequences which are close to their means with high probability. 
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Theorem 1. Suppose that {G"}^]^ is a sequence of graphs on n ^ oo vertices 
with degree sequence of limit distribution {A^j^j,. Further suppose that 

^deg(^;)2=0(n2-) 

for some e > 0. Then the degree sequence of also has a limit distribution 

Proof. Let Ud = ad{n) be the fraction of vertices of degree d in G" and let 
bd = bd{n) be the fraction of vertices of degree d in G^. Clearly it suffices to 
show that bd is concentrated about its expectation. 

To that end, wc apply the Azuma-HocfFding inequality to the "vertex expo- 
sure" martingale. In particular, consider the following process. Fix d> k, order 
the vertices of G" as fi, . . . , w„, and let denote the event that Vm & G^. Let 
Xq = E[6f;n], and let Xm+i = 'E[X,fn\A,fn+i]. That is, at stage m, wc "expose" 
vertex m and recalculate the expected number of vertices of degree d based on 
the new information concerning whether or not Vm & Gp. It is easy to see that 
this is a martingale, and, furthermore, that — Xrn\ < deg(?;„i_|_i) + 1, 

where deg(-) denotes degree in G. Since bdU = X„, we may apply the Azuma- 
Hoeffding inequality to get 

P [\bd - Xd\ > t/n] < exp (2^;;^^^(d;gK) + l)0 
for t > 0. Since X;l^=i deg{vm)'^ = ©(n^"^) and 

n / n \ 1/2 

^ deg(^;„) <V^iY. degC^'™)' = 0{n'/^-'/^) 

m— 1 \m— 1 / 

by Cauchy-Schwarz, wc can set t = n^""^/^, getting 

P[|&d-Ad|>t/n]<e-"("^''). (17) 
Let t' = t/n = rr'^l^. Then, since 



n=l 



.d=fe 



oo 

e/2 



< ^ ne < oo. 



n=l 



the Borel-Cantelli Lemma implies that asymptotically almost surely, |6d — A^l < 
t' = o{n) for all d> k. 

□ 

We have the following theorem. 

Theorem 2. For any integer k > (3 > 1, the degree sequence starting at k 
defined by ad = G/3('^^'^)n + o(n) is scale-free. Moreover, if a graph G on n 
vertices such that 

deg{v)^ = 0(n2-^) 
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for some e > has a scale- free degree sequence starting at k, then there is a 
(3 G (1, k) so that aa = Cp['^~^^)n + o{n). As a consequence, sparse graphs with 
scale-free degree sequence are power-law graphs. 

4 Scale-free set system 

Many power-law graphs like the Collaboration Graph and the Hollywood Graph 
are actually better modeled by set systems (or hypergraphs) rather than graphs. 
For example, in the Math Reviews database, each published item has one or 
more authors. The family of all papers considered as collections of authors 
forms a set system. The Collaboration Graph only captures part of the infor- 
mation in this set system. Here we quote from the Erdos number project |20j : 

There are about 1.9 million authored items in the Math Reviews 
database, by a total of about 4-01,000 different authors. ... Approxi- 
mately 62.4-% of these items are by a single author, 27.4% au- 
thors, 8.0% by three authors, 1.7% by four authors, 0.4% by five authors, 
and 0.1% by six or more authors. 

In this example, the distribution of set-sizes follows a power-law distribution. 
Is this just a coincidence? Is "scale-free" distribution of a set system always a 
power-law distribution? 



number of authors in a paper 



Figure 1: The precentage of multiple-author-paper in AMS Review database. 

Motivated by this example and "scale-free" graphs, we consider the following 
problem. For a set system J- and any probability p G (0, 1), the random sub-set- 
system J-p is chosen by independently removing vertices with probability 1 — p 
and reducing the sets to their remaining elements. 

Problem 1. For what sequence of set- sizes in a set system T, is the sequence 
of the set-sizes in random sub-set-system J-p essentially the same as the original 
sequence up to a scale? 

For i > 1, let a, be the number of «-sets in and &, be the number of i-sets 
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in J^p. We are asking if there is a function f{p) such that 

bi = f{p)ai + o{n) 

for all i > k. Here fc is a small positive integer. 
Since the expected value E(6i) satisfies 



It is necessary to have 

i 



m)^T.^,l^l^p'{^-py-'- (18) 



Y,a,['^)p\^~Py~'^fip)a^ (19) 

j>i 



for all i > k. 

Let A{x) — J2i o-i^^ be the generating function. For any p E (0, 1) and 
X E [— 1 , 1] , we have 

k~l k—1 /] \ 

f{p)iAix) - J2 "-i^") = (^(1 -p+p^)-T. idjp'^^ - p)'"')- (20) 

This is essentially the same equation as equation Thus we have the fol- 

lowing theorem. 

Theorem 3. // the sequence of set-sizes in a set-system starting at k > 1 is 
scale-free, then there are constants (3 £ (l,fc) and C such that the number of 
i-sets in this set-system is Cp{^~^^n + o(n) for all i > k. 



5 Remarks and questions 

Note that the results of the preceding sections have a probabilistic interpreta- 
tion. Suppose that, for each n, we have a probability distribution Q over graphs 
on n vertices with the property that the expected number of vertices of degree 
d is Od- Then, what must E[ad] be if, when G is sampled from Q and a random 
subgraph Gp is taken, the expected number bd of vertices of degree d after scal- 
ing so that Od = X^d is the same as ad? The above analysis provides the 
answer: the expectation of Od must be a power law in d. 

Now, it is natural to ask, if the variance of the bd is scaled as the square 
of the scaling factor for the expectations, then what must a^{ad) be? In fact, 
one can ask the same question of all moments, leading to the following open 
problem: 

Problem 2. Fix p e (0, 1). Let G be drawn from a probability distribution Q 
on graphs with n vertices. Suppose that Od, d > 0, is the number of vertices of 
degree d in G, and bd, d>Q, is the number of vertices of degree d in Gp. For 
which distributions Q is it true that there exists some c{p) S M so that {ad}d>k 
and {c{p)bd\d>k have approximately the same distribution for large n? Is it 
possible to find such Q for all p G (0, 1) simultaneously? 
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Currently, the exponents of "real- world" seale-frce networks' power laws is 
estimated in a rather ad-hoc fashion, usually using a regression on the log-log 
plot of frequency vs. degree after removing the extremes of the data. If it were 
possible to describe scalc-frec distributions exactly, then it would make sense to 
ask the following very practical question: 

Problem 3. Find an unbiased estimator for the exponent of a power-law degree 
distribution. 

For the matter of the variance of the a^, we note that, at least for /3 e (1,2), 
the following must be true: 

This statement can be proven by applying the formula 

JV 

a\^Xi) = E[Xi]V(iV) +E[iV]a2(Xi) 

i=l 

for i.i.d. variables Xi and an independent variable N taking on nonnegative 
integer values. 

We also ask, what can be proved by extending the definition of scale-freeness 

to hypcrgraphs? We believe that the situation is very similar to that of graphs 
when the hypergraphs being considered are uniform (with edges removed when- 
ever at least one of their vertices is removed). Perhaps the answer lies in a 
more refined description of scalc-frecness. For example, consider the quantity 
Off (G), the number of occurrences of H as an induced subgraph of G. Suppose 
that aH{G)/n — > aR for each H and some an € K"*", and that this sequence is 
scale- free, i.e., 

aniGp) cx (G) 

for any fixed p with < j? < 1 and H varying over all graphs on at least k 
vertices. Then what must G look like? 
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